Reputation-based Worker Filtering in Crowdsourcing

نویسندگان

  • Srikanth Jagabathula
  • Lakshminarayanan Subramanian
  • Ashwin Venkataraman
چکیده

In this paper, we study the problem of aggregating noisy labels from crowd workers to infer the underlying true labels of binary tasks. Unlike most prior work which has examined this problem under the random worker paradigm, we consider a much broader class of adversarial workers with no specific assumptions on their labeling strategy. Our key contribution is the design of a computationally efficient reputation algorithm to identify and filter out these adversarial workers in crowdsourcing systems. Our algorithm uses the concept of optimal semi-matchings in conjunction with worker penalties based on label disagreements, to assign a reputation score for every worker. We provide strong theoretical guarantees for deterministic adversarial strategies as well as the extreme case of sophisticated adversaries where we analyze the worst-case behavior of our algorithm. Finally, we show that our reputation algorithm can significantly improve the accuracy of existing label aggregation algorithms in real-world crowdsourcing datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Analytic Approach to People Evaluation in Crowdsourcing Systems

Worker selection is a significant and challenging issue in crowdsourcing systems. Such selection is usually based on an assessment of the reputation of the individual workers participating in such systems. However, assessing the credibility and adequacy of such calculated reputation is a real challenge. In this paper, we propose an analytic model which leverages the values of the tasks complete...

متن کامل

Identifying Unreliable and Adversarial Workers in Crowdsourced Labeling Tasks

In this paper, we study the problem of aggregating noisy responses from crowd workers to infer the unknown true labels of binary tasks. Unlike most prior work which has examined this problem under the probabilistic worker paradigm, we consider a much broader class of adversarial workers with no specific assumptions on their labeling strategy. Our key contribution is the design of a computationa...

متن کامل

Reputation-based Worker Filtering in Crowdsourcing

A Proofs of the theorems We first state a few helper lemmas. Lemma 1. Suppose the graph G is an (l, r)-regular graph, i.e. worker degree is l and task degree is r. Then, for each (w i , t j) 2 G, the following is true Pr(w i (t j) = 1) = 1 + (2 1)µ 2 , Pr(w i (t j) = 1) = 1 (2 1)µ 2 , and E[d + j ] = r 1 + (2 1)µ 2 , E[d j ] = r 1 (2 1)µ 2 where the probability and expectation are taken over th...

متن کامل

An incentive mechanism with privacy protection in mobile crowdsourcing systems

In order to improve the efficiency and utility of mobile crowdsourcing systems, this paper proposes an incentive mechanism with privacy protection in mobile crowdsourcing systems. Combining the advantages of offline incentive mechanisms and online incentive mechanisms, this paper proposes an incentive mechanism that selects the worker candidates statically, and then dynamically selects winners ...

متن کامل

Crowdworker Filtering with Support Vector Machine

Crowdsourcing has been recognized as a possible technique to complement costly user studies, usability studies, relevance judgment for information retrieval studies, and training set build-up for automatic document classification. However, the quality of crowdworkers varies by diverse factors and we often cannot tell whether their answers are right or wrong immediately due to the lack of gold s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014